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Abstract — n this work we examine blind adaptive and it- 
erative decision feedback (DF) receivers for direct sequence 
code division multiple access (DS-CDMA) systems in frequency 
selective channels. Code-constrained minimum variance (CMV) 
and constant modulus (CCM) design criteria for DF receivers 
based on constrained optimization techniques are investigated for 
scenarios subject to multipath. Computationally efficient blind 
adaptive stochastic gradient (SG) and recursive least squares 
(RLS) algorithms are developed for estimating the parameters 
of DF detectors along with successive, parallel and iterative DF 
structures. A novel successive parallel arbitrated DF scheme 
is presented and combined with iterative techniques for use 
with cascaded DF stages in order to mitigate the deleterious 
effects of error propagation. Simulation results for an uplink 
scenario assess the algorithms, the blind adaptive DF detectors 
against linear receivers and evaluate the effects of error prop- 
agation of the new cancellations techniques against previously 
reported approaches.n this work we examine blind adaptive and 
iterative decision feedback (DF) receivers for direct sequence 
code division multiple access (DS-CDMA) systems in frequency 
selective channels. Code-constrained minimum variance (CMV) 
and constant modulus (CCM) design criteria for DF receivers 
based on constrained optimization techniques are investigated for 
scenarios subject to multipath. Computationally efficient blind 
adaptive stochastic gradient (SG) and recursive least squares 
(RLS) algorithms are developed for estimating the parameters 
of DF detectors along with successive, parallel and iterative DF 
structures. A novel successive parallel arbitrated DF scheme is 
presented and combined with iterative techniques for use with 
cascaded DF stages in order to mitigate the deleterious effects 
of error propagation. Simulation results for an uplink scenario 
assess the algorithms, the blind adaptive DF detectors against 
linear receivers and evaluate the effects of error propagation 
of the new cancellations techniques against previously reported 
approaches.! 



I. Introduction 

CODE division multiple access (CDMA) implemented 
with direct sequence (DS) spread-spectrum signalling 
is amongst the most promising multiple access technologies 
for current and future communication systems. Such services 
include third-generation cellular telephony, indoor wireless 
networks, terrestrial and satellite communication systems. The 
advantages of CDMA include good performance in multi-path 
channels, flexibility in the allocation of channels, increased 
capacity in bursty and fading environments and the ability 
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to share bandwidth with narrowband communication systems 
without deterioration of cither's systems performance [ll, 121. 

Demodulating a desired user in a DS-CDMA network 
requires processing the received signal in order to mitigate dif- 
ferent types of interference, namely, narrowband interference 
(NBI), multi-access interference (MAI), inter-symbol interfer- 
ence (ISI) and the noise at the receiver The major source of 
interference in most CDMA systems is MAI, which arises due 
to the fact that users communicate through the same physical 
channel with non-orthogonal signals. The conventional (single- 
user) receiver that employs a filter matched to the signature 
sequence does not suppress MAI and is very sensitive to 
differences in power between the received signals (near-far 
problem). Multiuser detection has been proposed as a means 
to suppress MAI, increasing the capacity and the performance 
of CDMA systems fTl, f^^. The optimal multiuser detector of 
Verdu |3 1 suffers from exponential complexity and requires the 
knowledge of timing, amplitude and signature sequences. This 
fact has motivated the development of various sub-optimal 
strategies: the linear [4J and decision feedback [5| receivers, 
the successive interference canceller [6 | and the multistage 
detector fT^. For uplink scenarios, decision feedback detection, 
which is relatively simple and performs linear interference 
suppression followed by interference cancellation was shown 
to provide substantial gains over linear detection (|5], flS], ifToll . 
fTTl. 

When used with short code or repeated spreading codes, 
adaptive signal processing methods are suitable to CDMA 
systems because they can track the highly dynamic conditions 
often encountered in such systems due to the mobility of 
mobile terminals and the random nature of the channel ac- 
cess. Adaptive techniques can also alleviate the computational 
complexity required for parameter estimation. In particular, 
blind adaptive signal processing is an interesting alternative 
for situations where a receiver loses track of the desired user 
and/or a training sequence is not available. In this context, 
blind linear receivers for DS-CDMA have been proposed in 
the last years to supress MAI lfT3l. lfT4l. ifTSl. lfT6l. ifTTl. ifTSll. 
Blind linear solutions for flat channels have been reported 
for the first time in |fT3l . where the blind detector was 
designed on the basis of the minimum output energy (MOE) or 
minimum variance (MV). Following the initial success of the 
MV receiver [131, blind receivers using the constant modulus 
(CM) criterion, which outperformed their MV counterparts, 
were reported in Iil4l . llT6l and ifTTl . In this context, the work 
by Tugnait and Li fT7l| is an inverse filtering criterion and does 
not exploit the energy contained in the signal copies available 
in multipath, leading to performance degradation as compared 
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to supervised solutions. In order to improve performance and 
close the gap between blind and trained solutions, Xu and 
Tsatsanis [151 exploited the multipath components through 
a constrained MV (CMV) method HSl that treats different 
signal copies as variables and jointly optimizes the receiver 
and channel parameters. Another solution that outperforms 
the CMV technique of IfTSI was proposed by Xu and Liu 
ifTSl for multipath environments, in which constrained adaptive 
linear receivers are derived based upon the joint optimization 
of channel and receiver parameters in accordance with the 
constant modulus criterion. Recently, a code-constrained CM 
design for linear receivers and an RLS algorithm, that out- 
perform previous approaches, were presented in [ETI for a 
downlink scenario. 

Although relatively simple, DF structures can perform sig- 
nificantly better than linear systems and the existing work 
on blind adaptive DF receivers was restricted to single-path 
channels solutions [22J, [23 1, [24J and have to be modified for 
multipath. Detectors with DF are especially interesting because 
they offer the possibility of different types of cancellation, 
namely, successive [8|, [9J, parallel ifTOl and iterative [1 11 , 
llT2l . which lead to different performances and degrees of 
robustness against error propagation. This paper addresses 
blind adaptive DF detection for multipath channels in DS- 
CDMA systems based on constrained optimization techniques 
using the MV and CM criteria. The CMV and CCM solutions 
for the design of blind DF CDMA receivers are presented and 
then computationally efficient blind adaptive algorithms are 
developed for MAI, intersymbol interference (ISI) suppression 
and channel estimation. The second contribution of this work 
is a novel successive parallel arbitrated DF structure based on 
the recent concept of parallel arbitration [25 1. The new DF 
detector is then combined with iterative cascaded DF stages, 
resulting in an improved DF receiver structure that is compared 
with previously reported methods. Computer simulations ex- 
periments show the effectiveness of the proposed blind DF 
system for refining soft estimates and mitigating the effects of 
error propagation. 

This paper is organized as follows. Section II briefly de- 
scribes the DS-CDMA communication system model. The 
constrained decision feedback receivers and the blind channel 
estimation procedure are described in Section III. Section IV 
is devoted to the successive parallel arbitrated and iterative DF 
cancellation techniques, whereas Section V is dedicated to the 
derivation of adaptive SG algorithms and RLS type algorithms. 
Section VI presents and discusses the simulation results and 
Section VII gives the conclusions of this work. 



II. DS-CDMA SYSTEM MODEL 

Let us consider the uplink of a symbol synchronous binary 
phase-shift keying (BPSK) DS-CDMA system with K users, 
N chips per symbol and Lp propagation paths. It should be 
remarked that a synchronous model is assumed for simplicity, 
although it captures most of the features of more realistic 
asynchronous models with small to moderate delay spreads. 
The baseband signal transmitted by the fc-th active user to the 



base station is given by 

oo 

Xk{t)^Ak ^ hk{i)sk{t - iT) 



(1) 



where bk{i) G {±1} denotes the i-th symbol for user k, the 
real valued spreading waveform and the amplitude associated 
with user k are Sk{t) and Ak, respectively. The spreading 
waveforms are expressed by Sk{t) = X^iLi o.k{i)4'{t ^ iTc), 
where ak{i) G {±1/^/N}, 4>(t) is the chip waverform, Tc 
is the chip duration and N — T/Tc is the processing gain. 
Assuming that the receiver is synchronised with the main path, 
the coherently demodulated composite received signal is 
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fc=l 1=0 



where hk^i{t) and Tk^i are, respectively, the channel coefficient 
and the delay associated with the /-th path and the fc-th user. 
Assuming that r^ j = ITc, the channel is constant during each 
symbol interval and the spreading codes are repeated from 
symbol to symbol, the received signal r{t) after filtering by a 
chip-pulse matched filter and sampled at chip rate yields the 
A/-dimensional received vector 



K 



r{i) = ^ Hfc(i)AfcSfcbfc(i) + n(i) 



(3) 



fc=i 



where M = N + Lp - I, n(i) = [ni{i) 
the complex Gaussian noise vector with £'[n(i)n^(z)] = cr^I, 
where (•)^ and (•)^ denote transpose and Hermitian transpose, 
respectively, E[-] stands for ensemble average, the user symbol 
vector is bfe(j) = [bk{i) ■ ■ ■ hk{i — Lg + 1)]"^, the amplitude 
of user k is Ak, the channel vector of user fc is hfe(i) = 
[hkfiii) ■ ■ ■ hk.L -^s is the ISI span and the (Lg x 

N) X Lg diagonal matrix Sk with A^-chips shifted versions 
of the signature of user fc is given by 



Sfe ... 
Sfc ■. 







(4) 



where — [afc(l) . . .ak{N)]'^ is the signature sequence for 
the fc-th user and the M x (Lg x N) channel matrix Hfe(i) 
for user fc is 



Hfc(i) = 



















hkfi{i) 



hk,Lp-i{i) 
(5) 

where hk.i{i) = hk.i{iTc)- The MAI comes from the 
non-orthogonality between the received signature sequences, 
whereas the ISI span Lg depends on the length of the channel 
response, which is related to the length of the chip sequence. 
For Lp = l, Lg = l (no ISI), for 
N < Lp< 2N, Lg = 3. 



1 < Lp < N,Lg ^2, for 
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Fig. 1. Block diagram of a blind multiuser decision feedback receiver 



III. Blind Decision Feedback Constrained 
Receivers 

Let us describe the design of synchronous blind decision 
feedback constrained detectors, as the one shown in Fig. 1. It 
should be remarked that portions of the material presented here 
were presented in |19jj. Consider the received vector r(i), and 
let us introduce the Mx Lp constraint matrix that contains 
one-chip shifted versions of the signature sequence for user k: 

" afc(l) 

: ■■• afc(l) 

ak{N) 

■■• ak{N) 



(6) 



The input to the hard decision device, depicted in Fig. 1, 
corresponding to the ith symbol is 



z(i) = W^(^)r(^)-F^(^)b(^) 



(7) 



where the input z{i) = [zi{i) ... zxii)]'^, W(i) = 
[wi . . . wk] is M X K the feedforward matrix, b(i) — 
. . . is the A' x 1 vector of estimated symbols, 

which are fed back through the K x K feedback matrix 
F(i) = [fi(i) • • ■ fA'(*)]- Generally, the DF receiver design is 
equivalent to determining for user k a feedforward filter Wfc(i) 
with M elements and a feedback one ffc(i) with K elements 
that provide an estimate of the desired symbol: 



z,(z)=wf(z)r(z)-ffWbW , 



fc = l,2,...,if (8) 



where h(i) = sgn[3fi(W^(i)r(i))] is the vector with initial 
decisions provided by the linear section, Wfc and are 
optimized by the MV or the CM cost functions, subject to a 
set multipath constraints given by C^Wfe(i) = hk{i) for the 
MV case, or Cfwk{i) — i^h.k{i) for the CM case, where ly is 
a constant to ensure the convexity of the CM-based receiver 
and hfc(z) is the fcth user channel vector In particular, the 
feedback filter ffe(i) of user k has a number of non-zero 
coefficients corresponding to the available number of feedback 
connections for each type of cancellation structure. The final 
detected symbol is obtained with: 



bl{i) = sgni 



sgn 



3* 



wf(*)r(*)-ff(z)b(* 



(9) 

where selects the real part and sgn{.) is the signum 
function. For successive DF (S-DF) JS], the K x K matrix 



F(z) is strictly lower triangular, whereas for parallel DF (P- 
DF) fTO], fTT| F(i) is full and constrained to have zeros on 
the main diagonal in order to avoid cancelling the desired 
symbols. The S-DF structure is optimal in the sense of that it 
achieves the sum capacity of the synchronous CDMA channel 
with AWGN [9|. In addition, the S-DF scheme is less affected 
by error propagation although it generally does not provide 
uniform performance over the user population, which is a 
desirable characteristic for uplink scenarios. In this context, 
the P-DF system can offer uniform performance over the users 
but is it suffers from error propagation. In order to design 
the DF receivers and satisfy the constraints of S-DF and P- 
DF structures, the designer must obtain the vector with initial 
decisions b(i) = sg7i[3fi(W^(i)r(i))] and then resort to the 
following cancellation approach. The non-zero part of the filter 
ffc corresponds to the number of used feedback connections 
and to the users to be cancelled. For the S-DF, the number 
of feedback elements and their associated number of non-zero 
filter coefficients in (where k goes from the second detected 
user to the last one) range from 1 to K — 1. For the P-DF, 
the feedback connections used and their associated number of 
non-zero filter coefficients in are equal to i^T ~ 1 for all 
users and the matrix F(i) has zeros on the main diagonal to 
avoid cancelling the desired symbols. 

In what follows, constrained CM and MV design criteria for 
DF detectors are presented. The CMV design for DF receivers 
generalizes the work on linear structures of Xu and Tsatsanis 
1 15 1, whereas the CCM design is proposed here for both linear 
and DF schemes. 

A. DF Constrained Constant Modulus (DF-CCM) Receivers 

To describe the DF-CCM receiver design let us consider the 
CM cost function: 



JcM{^) = £;f(|wf «r(z)-ff Wb(*)p-1) 



= E] 



(10) 



subject to C^Wfc(i) — i^hkii), where Zk{i) = w^(i)r(i) — 
f^(i)b(i). Assuming that the channel vector is known, 
let us consider the unconstrained cost function J^Mi"^) = 
i?[(|wf (^)r(^) - ff (z)b(*)p - 1)^] + 25R[(Cfwfc(z) - 
v hfc(i))^A], where A is a vector of complex Lagrange 
multipliers. The function J^AfC*) minimized with re- 
spect to Wfc(i) and ffc(i) under the set of constraints 



Cfwfc(») 



hfc(i). Taking the gradient terms of 



JcM{i)' with respect to Wfe(i) and setting them to zero 
we have VJcAf(«)' = 2i;[(|wf (i)r(i) - ff (^)b(^)|2 - 
l)r(^)(r^(^)wfc(^) - b«(z)ffc(i))] + 2CfeA = 0, then re- 
arranging the terms we obtain £^[|zfc(i)pr(i)r^(i)]wfc(j) = 
E[zl{i)T{i)] + E[\zk{i)\^Y{i)h" - CfcA and conse- 
quently Wfc(i) = R^^(i)[dfc(j) + Tfc(i)ffc(i) - CfeA], where 
Rfe(*) = E[\zu{i)?v{i)^" Tfc(*) = E[\zk{i)?v{t)h{i)l 
dfc(i) = E[z^{i)Y{i)] and the asterisk denotes complex 
conjugation. Using the constraint C^Wfe(i) = v hfe(i) 
wfe ^iri^e • • at ^he expression for the Lagrange multi- 
plier A = (CfR,Ti(z)Cfc)-i(CfR,Ti(*)T,(*)ffc(*) + 
Cl'^Rj, ^{i)dk{i) — V hfc(j)). By substituting A into Wfc(i) = 
Rj^^(i)[dfe(i) + Tfe(i)ffe(i) — C^A] we obtain the solution for 
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the feedforward section of the DF-CCM receiver: 



dfcW+Tfc(i)ffcW-Cfc(CfR^i(z)Cfc)-^x 



(Cf R^i«Tfc(z)ffc(z) + Cf R,t1 Wdfc(z) - hfe(i)) 



1) 



where the expression in (11) is a function of previous val- 
ues of Wfc(i) and the channel hfc(i). To obtain the CCM 
solution for the parameter vector ffe of the feedback sec- 
tion, we compute the gradient terms of J^^j with respect 
to ffe and by setting them to zero we have VJ^j,^(i) = 
2E[{\zk{i)\^ - l)b(z)(r^(z)wfc(*) - h"{t)ikm - 0, then 
rearranging the terms we get £'[|zfe(i)pb(i)b^(i)]ffe(i) = 
i?[|zfc(i)pb(j)r^(i)]wfc(i) — £^[z^(«)b(i)] and consequently 
we have 



(12) 



where 1^ = E[\zkii)\^h{i)h" (i)] and = E[z*{i)h{i)]. 
We remark that (11) and (12) should be iterated in order to 
estimate the desired user symbols. The CCM linear receiver 
solution proposed in 11211 is obtained by making ffe(i) — 
in (11). An analysis of the CCM method in the Appendix 
I examines its convergence properties for the linear receiver 
case, extending previous results on its convexity for both 
complex and multipath signals. Since the optimization of the 
CCM cost function for a linear receiver (ffc(i) = 0) is a convex 
optimization, as shown in the Appendix I, it provides a good 
starting point for performing the cancellation of the associated 
users by the feedforward section of the DF-CCM receiver 

B. DF Constrained Minimum Variance (DF-CMV) Receivers 

The DF-CMV receiver design resembles the DF-CCM de- 
sign and considers the following cost function : 



Jmv — E 



wf(z)r(z)-ff WbWI 



(13) 



subject to C^Wfc(i) = hfe(i). Given the channel vector hk{i) 
let us consider the unconstrained cost function J'Mvi"^) = 
E 



.fWrW-ffWbWP 



23?[(CfwfcW-hfeW)^A], 
where A is a vector of cornplex Lagrange multipliers, and 
minimize J'mv{^) ^i'^^ respect to Wfc(i) and ffc(i) under 
the set of constraints C^Wfc(i) = hfe(i). By taking the 
gradient terms of with respect to Wfe(«) and setting 

them to zero we have VJ^^y(i) — E[r{i){r^ {i)wk{i) — 
b^(i)ffe(i))] + 2CfeA = 0, then rearranging the terms we 
obtain E[r{i)r" {i)]wk{i) = E[r{i)h" {i)]ik{i) - 2CfcA and 
consequently Wfe(i) — R^-'(i)[T(i)ffc(i) — 2CfcA], where 
the covariance matrix is R = E[r{i)r^ (i)] and T(i) — 
E[r{i)h{i)]. Using the constraint C^Wfe(i) — hfc(i) we 
arrive at the expression for the Lagrange multiplier A — 
(Cf R-i(OCfc)-i(Cf R-l(^)T(^)ffe(^) - hfc(z))/2. By sub- 
stituting A into Wfe(i) = R"i(i)[T(i)ffc(i) - 2CfcA] we 
obtain the solution for the feedforward section of the DF-CMV 
receiver: 



T(z)ffcW-Cfc(CfR-i(i)Cfc)' 



(CfR-iWT(z)ffc(z)-hfc(z)) 



(14) 



Next, we compute the gradient terms of J'j^jyii) with re- 
spect to ffe(i) and set them to zero to get VJ']^fy{i) = 
E[h{i){r^ {i)wk{i) — {i)ik{i))] = 0, then rearranging the 
terms we have E[h{i)h^ {i)]{k{i) = E[h{i)r^ {i)]-Wk{i) and 
consequently we obtain 



(15) 



where B(i) — E[h{i)h^ (i)]. At this point, the designer can 
avoid the inversion of B(i) by using a judicious approxima- 
tion, that is I w £'[b(i)b^(i)] ID, which is verified unless 
the error rate is high. Hence, the feedback section filter can 
be designed as given by ffc(i) « T^(i)wfc(i). It should also 
be noted that by making ffc(i) = we arrive at the solution 
of Xu and Tsatsanis in |15|. 

C. Blind Channel Estimation 

The solutions for the CCM and CMV DF receivers assume 
the knowledge of the channel parameters. However, in appli- 
cations where multipath is present these parameters are not 
known and thus channel estimation is required. To blindly 
estimate the channel we use the method of ifTSl . Il26l : 

hfc(i)=argmin hJCjR-?'(z)Cfehfc (16) 

hk 

subject to I |hfc 1 1 = 1, where p is an integer and whose solution 
is the eigenvector corresponding to the minimum eigenvalue 
of the Lp X Lp matrix C^R~P(i)Cfc. For the CCM receiver 
we employ Rfc(i) in lieu of R(i) (used for the CMV) for 
channel estimation. The use of Rfc(j) instead of R avoids the 
estimation of both R(i) and Rfc(j), and shows no performance 
loss as verified in our studies and explained in Appendix IV. 
The values of p are restricted to 1 even though the performance 
of the channel estimator and consequently of the receiver can 
be improved by increasing p. 

IV. Successive Parallel Arbitrated and Iterative 
DF Detection 

In this section, we present novel iterative techniques, which 
are based on the recently introduced concept of parallel 
arbitration ll25l . and combine them with iterative cascaded DF 
stages |lT), lfT2l . The motivation for the novel DF structures 
is to mitigate the effects of error propagation often found 
in P-DF structures [TTl, {lT\, that are of great interest for 
uplink scenarios due to its capability of providing uniform 
performance over the users. The basic idea is to improve the 
S-DF structure using parallel searches and then combine it 
with an iterative technique, where the second stage uses a P- 
DF system to equalize the performance of the users. 

A. Successive Parallel Arbitrated DF Detection 

The idea of parallel arbitration is to employ successive 
interference cancellation (SIC) to rapidly converge to a local 
maximum of the likelihood function and, by running parallel 
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branches of SIC with different orders of cancellation, one can 
arrive at sufficiently different local maxima [25 1. In order to 
obtain the benefits of parallel search, the candidates should 
be arbitrated, yielding different estimates of a symbol. The 
estimate of a symbol that has the highest likelihood is then 
selected at the output. 

Unlike the work of Barriac and Madhow ||25l that employed 
matched filters as the starting point, we adopt blind DF 
receivers as the initial condition. The concept of parallel 
arbitration is thus incorporated into a DF detector structure, 
that applies linear interference suppression followed by SIC 
and yields improved starting points as compared to matched 
filters. It is also worth noting that our approach does not 
require regeneration as occurs with the original PASIC in (55) 
because the blind adaptive filters automatically compute the 
coefficients for interference cancellation. A block diagram of 
the proposed scheme, denoted successive parallel arbitrated 
decision feedback (SPA-DF), is shown in Fig. 2. 




Fig. 2. Block diagram of tlie proposed blind SPA-DF receiver. 

Following the schematics of Fig. 2, the user k output of the 
parallel branch I (l = 1, . . . , L) for the SPA-DF receiver 
structure is given by: 



4« = wf(z)r(z)-[M,F]fb(z) 



(17) 



where the vector with initial decisions is b(i) — 
S(7n[3?(W^(i)r(i))] and the matrices M; are permutated 
square identity (I^) matrices with dimension K whose struc- 
tures for an L = 4-branch SPA-DF scheme are given by: 



Ml = Ik, Ma = 







K/4:,3K/4 



I 



M, 







K/2 



K/i 



M4 







/</4,3/f/4 







1 



1 







(18) 



where 0.,„ „ denotes an m x n-dimensional matrix full of 
zeros and the structures of the matrices Mi correspond to 
phase shifts regarding the cancellation order of the users. 
Indeed, the purpose of the matrices in (18) is to change the 
order of cancellation. When M = I the order of cancellation 
is a simple successive cancellation (S-DF) based upon the 
user powers (the same as LS], |9|). Specifically, the above 
matrices perform the cancellation with the following order 
with respect to user powers: Mi with 1,...,K; M2 with 
K/A, K/A + 1, . . . , is:, 1, . . . , K/A - 1;M3 with K/2, K/2 + 
l,...,K,l,..., K/2 - 1; M4 with K,...,l (reverse order). 
For more branches, additional phase shifts are applied with 



respect to user cancellation ordering. It is also worth noting 
that different update orders have been tried although they did 
not result in performance improvements. For the proposed 
SPA-DF, the number of feedback elements used and their 
associated number of non-zero filter coefficients in (where 
k goes from the second detected user to the last one) range 
from 1 to /-C — 1 according to the branch I and the matrix M;. 

The final output of the SPA-DF detector chooses the 
estimate of the L candidates as described by: 



arff max 

KKL 



|5i(z[(z) 



(19) 



where the selected estimate is the one with largest real 
magnitude, that forms the vector of final decisions b): (i) = 



{i)Y . The number of parallel branches L 



that yield detection candidates is a parameter that must be 
chosen by the designer. Our studies and computer simulations 
indicate that L = 4 achieves most of the gains of the proposed 
structure and offers a good trade-off between performance 
and complexity. In terms of complexity the SPA-DF system 
employs the same filters, namely W(i) and F(i), of the 
traditional S-DF and requires additional arithmetic operations 
to compute the parallel arbitrated candidates. As occurs with 
S-DF receivers, a disadvantage of the SPA-DF detector is that 
it generally does not provide uniform performance over the 
user population. Specifically, in a scenario with tight power 
control successive techniques tend to favor the last detected 
users, resulting in non-uniform performance. To equalize the 
performance of the users an iterative technique with multiple 
stages can be used. 

B. Iterative Successive Parallel Arbitrated DF Detection 

In ifTTl . Woodward et al. presented an iterative detector 
with an S-DF in the first stage and P-DF or S-DF structures, 
with users being demodulated in reverse order, in the second 
stage. The work of ifTTl was then extended to account for 
coded systems and training-based reduced-rank filters (12]. 
Differently from IfTTl . IIT2I . we focus on blind adaptive re- 
ceivers, uncoded systems and combine the proposed SPA-DF 
structure with iterative detection. An iterative receiver with 
hard-decision feedback is defined by the recursion: 



= W^(^)r(^) - F^(^)b(")(^) 



(20) 



where the filters W and F can be S-DF or P-DF structures, and 
b'"(i) is the vector of tentative decisions from the preceding 
iteration, where we have: 



h^^\i) = sgn(^^ {i)r{i) 



m > 1 



(21) 



(22) 



where the number of stages m depends on the application. 
Additional stages can be added where the order of the users 
is reversed from stage to stage. 

To equalize the performance over the user population, we 
consider the two-stage structure shown in Fig. 3. The first stage 
is an SPA-DF scheme with filters and F^. The tentative 
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of II27I with SG CCM-based algorithms considerably improves 
its performance, because blind channel estimators that rely on 
the CM criterion show poor performance and depend on other 
methods for initialization, as pointed out in lITSl . 

In terms of performance, RLS recursions have the potential 
to achieve good performance independently of the spread of 
the eigenvalues of the input signal autocorrelation matrix, have 
faster convergence performance, show superior performance 
under fast frequency selective fading channels and can cope 
with larger systems ||29l than SG techniques. 

In terms of complexity SG algorithms require a number of 
operations that grows linearly with M and additional users 
in order to suppress MAI, ISI and estimate the channel |[27l . 
whereas RLS techniques have quadratic complexity implemen- 
tation for MAI, ISI suppression and channel estimation. 

A. Stochastic Gradient and RLS Blind Channel Estimation 

The channel estimate hfc(i) is obtained through the power 
method and the SG and RLS techniques described in fW\. 
The methods are SG and RLS adaptive version of the blind 
channel estimation algorithms described in (16) and introduced 
in |27|. The SG recursion requires only 0{LV} arithmetic 
operations to estimate the channel, against 0{L^) of its SVD 
version. For the RLS version, the SVD on the Lp x Lp matrix 
Cf R~i(i)Cfc, as stated in (16) and that requires 0(ip), is 
avoided and replaced by a single matrix-vector multiplication, 
resulting in the reduction of the corresponding computational 
complexity on one order of magnitude and no performance 
loss. For the CCM-RLS algorithms, can be employed 
instead of R (used for the CMV) for channel estimation to 
avoid the estimation of both R and R^. The use of R^ instead 
of R shows no performance loss as verified in our studies and 
is explained in Appendix IV. 





SPA-DF 


b H 
k- 


3-DFor 
P-DF 
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Fig. 3. Block diagram of the two-stage DF receiver with SPA-DF scheme 
in the first stage. The second stage can employ S-DF or P-DF structures to 
demodulate users in reverse order relative to the first branch of the first stage, 
that uses S-DF detection. 



decisions are passed to the second stage, which consists of an 
S-DF or an P-DF detector with filters W'^ and F-^. The users in 
the second stage are demodulated successively and in reverse 
order relative to the first branch of the SPA-DF structure (a 
conventional S-DF). The resulting iterative receiver system is 
denoted ISPAS-DF when an S-DF scheme is deployed in the 
second stage, whereas for a P-DF filters in the second stage the 
overall scheme is called ISPAP-DF. The output of the second 
stage of the resulting scheme is expressed by: 

ii) = [MV^2 (i) - [MF^ (i)]f b(2) it) (23) 

where Zj is the jth component of the soft output vector 
z, M is a square permutation matrix with ones along the 
reverse diagonal and zeros elsewhere (similar to M4 in (18)), 
denotes the jth column of the argument (a matrix), and 
= sgn\R{z™{i?))]. Note that additional stages can be 
included or the SPA-DF scheme can be used in the second 
stage, even though our studies indicate that the gains in 
performance are marginal. Hence, the two-stage structure is 
adopted for the rest of this work. It should also be remarked 
that, due to the difficulty of theoretically analyzing parallel 
arbitrated and iterative schemes, our analysis in Section VI 
is mainly focused on computer simulation experiments. A 
theoretical analysis of iterative DF schemes constitutes an open 
topic which is beyond the scope of this paper 

V. Adaptive Algorithms 

In this section we describe stochastic gradient (SG) and 
recursive least squares (RLS) algorithms for the blind esti- 
mation of the channel, the feedforward and feedback sections 
of DF receivers using the CM and MV criteria along with 
constrained optimization techniques, as illustrated in Fig. 1. 
The CMV-based algorithms are extensions for DF detection 
of the techniques proposed by Xu and Tsatsanis in [ISJ. The 
CCM-SG recursions represent an extension of the work of (ITSl 
for complex signals and DF receivers, whereas the CCM-RLS 
algorithms are novel for both linear and DF structures. 

It should be emphasized that the SG solutions presented in 
this section differ from those reported in a previous work [EOl 
in the sense that the blind channel estimation is decoupled 
from the feedforward and feedback recursions. Indeed, we 
adopt the SG blind channel estimation reported in ll27l that 
has been shown to outperform the one proposed in f\5\. Our 
studies also reveal that when the system deals with high loads 
{K is large) and the performance is poorer, a decoupled SG 
blind channel estimator, such as fZT^, is significantly less 
affected than the approach that optimizes Wfe, ffe and hj. as in 
II20I . In addition, the deployment of the SG blind estimator 



B. Constrained Constant Modulus Stochastic Gradient ( CCM- 
SG) Algorithm 

An SG solution to (10) and (11) can be devised by using 
instantaneous estimates and taking the gradient terms with 
respect to Wfe(i) and ikii) which should adaptively minimize 
JcAi with respect to Wfc(i) and ffe(«). The recursions of I27J 
are used to obtain channel estimates. If we consider the set 
of constraints C^Wfe(i) = hfe(i), we arrive at the update 
equations for the estimation of Wfe(i) and ffe(i): 

Wk{t+1) = Pfe(wfc(^)-Ai^efc(^)z^*(^)r(^))+^. Cfe(Cf Cfc)-ihfc(i) 

(24) 

ffc(^ + 1) = ik{i) - fifekit)zl{i)h{i) (25) 

where Zk{i) = wf (i)r(i) - ff (i)b(i), ek{i) = {\zk{i)\^ - 1) 
and Pfe = I — Cfe(C|'^Cfc)^^C^ is a matrix that projects 
the receiver's parameters onto another hyperplane in order to 
ensure the constraints. 

It is worth noting that, for stability and to facilitate tuning of 
parameters, it is useful to employ normalized step sizes when 
operating in a changing environment. A normalized version of 
this algorithm can be devised by substituting (24) and (25) into 
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the CM cost function, differentiating the cost function with re- 
spect to fijjj and II f, setting it to zero and solving the new equa- 
tions, as detailed in Appendix II. Hence, the normalized CCM- 
SG algorithm proposed here adopts variable step size mecha- 

_ MOm i\zk(i)\-tif\zkii)\ek{i)b"(i)h(i) + l) 



nisms described by /i^ 



MOf (|zfc(i)|-/j„|zfc(i)|ei.(i)r"(i)Pr(i) + l) 

and /io^ are the convergence factors for w^: and ik, respec- 
tively. 

C. Constrained Minimum Variance Stochastic Gradient 
(CMV-SG) Algorithm 

An SG solution to (13) and (14) can be developed in an 
analogous form to the previous section by taking the gradient 
terms with respect to Wfe(i) and ffe(i). The recursions in \{Tj\ 
are used again to obtain channel estimates. The update rules 
for the estimation of the parameters of the feedforward and 
feedback sections of the DF receiver are: 

Wk{t + 1) = Pfe(wfc(^) - fi^zl{i)r{i)) + Cfc(Cf Cfc)-'hfe(0 

(26) 

ffc(z + l) =ffcW-A*/4WbW (27) 

A normalized version of this algorithm can also be obtained 
by substituting (26) and (27) into the MV cost function, 
differentiating it with respect to /i^, and fif, setting it to 
zero and solving the new equations, as described in Ap- 
pendix III. Hence, fi^ = ''U COPr^ and = 

A.Oj(l-M,.r'^(»)Pr(i)) 
b«(i)b(i) 

D. Constrained Constant Modulus RLS ( CCM-RLS) Algo- 
rithm 

Given the expressions for the feedforward ( Wfc) and feed- 
back ( ffc) sections in (11) and (12) of the blind DF receiver, 
we need to estimate R^^(i), and (C^R^^(i)Cfe)"i 

recursively to reduce the computational complexity required to 
invert these matrices. Using the matrix inversion lemma and 
Kalman RLS recursions I29II we have: 



(28) 



l + a-ir««z,(*)R^i(z-l)z*(z)r(*) 

R^i(i) = a-lR^l(^- 1) -a-lGfc(^)zfc(^)r^(^)R-l(^- 1) 

(29) 



and 



V(z) 



(30) 



l + a-lb^(z)zfe(z)i,-l(*-lK(z)b(*) 

a-'i^\^ - 1) - a-'V{^)zk{^)h" ~ 1) 

(31) 

where < a < 1 is the forgetting factor The algorithm can be 
initialized with Rjr^(O) — 61 and 1/7^(0) — SI where 6 is a 
scalar to ensure numerical stability. Once Rj7^(j) is updated, 
we employ another recursion to estimate (C^Rjr^(j)Cfc)^^ 
as described by: 

T-\^ 1) r-^(. - 1)7, Wif Wr,T\^ - 1) 



l-a 



^ + (i-«)7fWr-i(z)7,« 

(32) 



where Tk{i) is an estimate of {C^'R^^{i)Ck) and 7fe(«) = 
C^r(i)zfe(i). The RLS channel estimation procedure de- 
scribed in [27 J with Tf^ in lieu of 0, is employed for 
estimating h,, saving computational resources and resulting 
in no performance loss for channel estimation. Finally, we 
construct the DF-CCM receiver as described by: 



Cf R-i (i) tfe (i) + Cf R-i (i)d, (i) - 1, hfe (i) 



(33) 



(34) 



where dk{i) is estimated by dk{i + 1) = adk{i) -I- (1 — 
a)z*(z)r(z), fk{i + l) = atfc(i) + (l-a)bfc(i)r^(z)|zfc(i)|2 
and Vfe(i + 1) = avfc(i) + (1 — a)z'l{i)h{i). In terms of 
computational complexity, the CCM-RLS algorithm requires 
0{M^) (feedforward section) and 0{K^) (feedback section) 
to suppress MAI and ISI and 0{Lp) to estimate the channel, 
against 0{AP), 0{K^) and 0{Ll) required by (11), (12) and 
(16), respectively. 

E. Constrained Minimum Variance RLS ( CMV-RLS) Algo- 
rithm 

Similarly to the CCM-RLS, the expressions for the DF- 
CMV receiver given in (14) and (15) are employed, and 
the matrices R"i(i), B-^{i) and (C^R^i(i)Cfc)-i are 
recursively estimated with the aid of the matrix inversion 
lemma in order to reduce the computational complexity as 
given by : 



R 

and 



a-iR-i(i- l)r(i) 
a-ir«(i)R-i(ii- l)r(i) 



Q(*) 



1 + a-^h^i {i)B-^\i - l)h{i) 



(35) 
(36) 

(37) 



B-'{i) = a-'B-'{i-l)-a-'Q{i)h"{i)B-'{i-l) (38) 

where < a < 1 is the forgetting factor The algorithm can 
be initiahzed with R^^O) = SI and B~i(0) = SI where 
(5 is a positive constant. Once R~^(i) is updated, we employ 
another recursion to estimate (C^R^^(i)Cfc)^^ as described 
by: 



(i^ + (l-a)0f(z)0,-i(z)0,(i) 

(39) " 

where 0fc(i) is an estimate of (C^R^^(?;)Cfe) and 9k{i) = 
C^r{i). For estimating the channel h^, the RLS algorithm 
described in |27| is employed. Finally, we construct the DF- 
CMV receiver as given by: 

Wfe(z) = R-i(«) ft(*)4(z) - Cfe0^i(z)x 



(40) 



g 



(41) 



where T(z + 1) = af{i) + (1 - a)hkii)r" (i). It should be 
remarked that the approximation on B, that is I « E[hh^], 
can be used when the error rate is low in order to avoid the 
matrix computations in (37) and (38). Otherwise, in the case 
of moderate to high error rate, it is preferable to employ (37 ) 
and (38) in order to guarantee adequate performance of the 
algorithm. 

VI. Simulations 

In this section, we evaluate the performance of the iterative 
arbitrated DF structures introduced in Section IV and the 
blind adaptive algorithms presented in Section V. Due to the 
extreme difficulty of theoretically analyzing such scheme, we 
adopt a simulations approach and conduct several experiments 
in order to verify the effectiveness of the proposed tech- 
niques. In particular, we have carried out experiments under 
stationary and non-stationary scenarios to assess convergence 
performance in terms of the the bit error rate (BER) of the 
proposed structure and algorithms and compared them with 
other recently reported algorithms and structures. Moreover, 
BER performance of the receivers employing the analyzed 
techniques is assessed for different loads, channel paths (Lp) 
and profiles, and fading rates. The DS-CDMA system employs 
Gold sequences of length N = 31. 

Because we focus on uplink scenarios, users experiment 
different channels. All channels assume that ip = 6 as an 
upper bound. We use three-path channels with relative powers 
Pk.i given by 0, —3 and —6 dB, where in each run and for each 
user the second path delay (T2) is given by a discrete uniform 
random variable (r v.) between 1 and 4 chips and the third path 
delay is taken from a discrete uniform r v. between 1 and 5— T2 
chips. It is also assumed here that the channels experienced 
by different users are statistically independent and identically 
distributed. The sequence of channel coefficients for each user 
k (k = 1, . . . ,K) is hk,i{i) = Pk,iak,i{i) (I = 0,1,2, . . .), 
where ak,i{i), is a complex Gaussian random sequence ob- 
tained by passing complex white Gaussian noise through 
a filter with approximate transfer function c/ y/l — {f / fd)'^ 
where c is a normalization constant, fd = v/X is the maximum 
Doppler shift, A is the wavelength of the carrier frequency, and 
V is the speed of the mobile |30|. This procedure corresponds 
to the generation of independent sequences of correlated unit 
power complex Gaussian random variables (i?[|a^ ;(*)|] = 1) 

with the path weights pk^i normalized so that J2f=iPk ; = 1- 
The phase ambiguity derived from the blind channel estimation 
method in [27] is eliminated in our simulations by using the 
phase of g(0) as a reference to remove the ambiguity and for 
fading channels we assume ideal phase tracking and express 
the results in terms of the normalized Doppler frequency fdT 
(cycles/symbol). Alternatively, differential modulation can be 
used to account for the phase rotations. 

In the following experiments, it is indicated the type of 
adaptive algorithms used (SG or RLS), the design criterion 
(CCM or CMV) and the structure (linear (L) or decision 
feedback (DF)). For linear receivers (L) and their algorithms 



we make ffc(i) = and /i/ = 0. Amongst the analyzed DF 
structures, we consider: 

. S-DF: the successive DF detector of ID, S. 

. P-DF: the parallel DF detector of HO), ifTTl . 

• ISS-DF: the iterative system of Woodward et al. ifTTl with 
S-DF in the first and second stages. 

• ISP-DF: the iterative system of Woodward et al. ifTTI with 
S-DF in the first stage and P-DF in the second stage. 

> SPA-DF: the proposed successive parallel arbitrated re- 
ceiver. 

• ISPAS-DF: the proposed iterative detector with the novel 
SPA-DF in the first stage and the S-DF in the second 
stage. 

• ISPAP-DF: the proposed iterative receiver with the SPA- 
DF in the first stage and the P-DF in the second stage. 

For the CCM based algorithms, we employ v = 1 m 
order to ensure convexity. The experiments are averaged over 
200 experiments and the parameters of the algorithms are 
optimized for each scenario. We stress that the results are 
shown in Figs. 4 to 8 in terms of the average BER 1 1 1 and 
average BER amongst the K users in the system, except for 
Figs. 9 and 10, where the individual BER performance of each 
user is shown. 

A. BER Convergence Performance 

In what follows, we assess the average BER convergence 
performance of the analyzed adaptive DF receiver techniques 
and algorithms. The BER convergence performance of the 
receivers is shown for SG and RLS algorithms, in Figs. 4 
and 5, respectively. We consider a non-stationary scenario, 
where the system starts with K = % users and at time 
i — 800, 4 additional users enter the system, totalling K — 12 
users, and the blind adaptive algorithms are subject to new 
interferers/users in the environment. For the sake of compar- 
ison, we also include the curves for supervised NLMS and 
RLS 1291 adaptive algorithms, which are trained with 200 
symbols provided by a pilot channel (at i = 1, . . . , 200 and 
i = 801, . . . , 1000) and then switch to decision-directed mode. 
It is assumed that the system has an ideal power control and 
signals of the different users reach the base station with the 
same average E\j/Nq. Note that given the performance of 
current power control algorithms, ideal power control is not 
far from a realistic situation. 

The algorithms for DF receivers are initialized with a 
feedforward filter equal to the signature sequence and a 
feedback filter with zeros and they gradually adapt in order 
to cancel the interference. Note that they do not lock to an 
undesired user because of the blind channel estimation that 
allows the receiver to use the effective signature sequence. The 
results indicate that the CCM design criterion is superior to 
the CMV approach, for both SG and RLS algorithms. Another 
conclusion from the curves in Figs. 4 and 5 is that CCM- 
based blind algorithms achieve a performance very close to 
the trained algorithms, leading to significant savings in spectral 
efficiency. Regarding the structures of the receivers, we note 
that DF receivers are significantly better than linear detectors. 
In fact, we attack the problem of the receivers presented in 



9 



EJN^=15dB, fT=0.001 
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Fig. 4. BER convergence performance of SG algorithms. 



ifTSl . fW\ that operate well in lightly loaded systems, but do 
not perform well in moderate and heavily loaded situations, by 
cancelling the interferers with the DF section. In particular, P- 
DF schemes outperform S-DF in low BER situations, whereas 
for moderate to higher BER levels S-DF systems are less 
affected by error propagation. 



of RLS algorithms is invariant to such situation ||291 . Hence, 
for large systems or those that do not have good power control 
RLS recursions are the most appropriate. 

Let us now consider the proposed SPA-DF and the com- 
bined iterative DF system, namely ISPAP-DF and ISPAS-DF. 
Simulation experiments with RLS algorithms were conducted 
to determine how many arbitrated branches should be used and 
to account for the impact of additional branches upon perfor- 
mance. We designed the novel DF receivers with L — 2,4,8 
parallel branches and compared their BER performance with 
the existing ISS-DF and ISP-DF structures, as depicted in 
Fig. 6. The results show that the novel SPA-DF ISPAP-DF 
and ISPAS-DF significantly outperform the ISS-DF and ISP- 
DF structures, and their performances improve as the number 
of parallel branches increase. In this regard, we also notice 
that the gains of performance obtained through additional 
branches decrease as additional branches are added, resulting 
in marginal improvements for more than L = 4 branches. For 
this reason, we adopt L ~ A for the remaining experiments 
because it presents a very attractive trade-off between perfor- 
mance and complexity. Another conclusion from the curves 
in Fig. 6 is that the proposed SPA-DF, ISPAP-DF and ISPAS- 
DF receiver techniques obtain substantial gains in performance 
over existing iterative DF techniques, namely, the ISP-DF and 
the ISS-DF of d. 
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Fig. 5. BER convergence performance of RLS algorithms. 



Fig. 6. BER convergence performance of RLS algorithms with iterative 
receivers for different numbers of parallel branches L at Et/No = 15 dB 
in a slow fading environment (a) CCM (b) CMV (L=2 - dashed Hne, L=4 - 
solid line and L=8 - dash-dotted line) . 



Another important conclusion from our studies is that RLS 
algorithms can deal with faster fading rates and effectively 
accommodate more users in the system at the cost of a 
quadratic complexity, whereas SG techniques cannot deal with 
large systems or very high load (K/N close to 1). Because 
the scenario in the experiments assumed ideal power control, 
the SG algorithms present a good convergence performance 
although for scenarios without power control (near-far situa- 
tions) the performance of these algorithms is subject to the 
eigenvalue spread of the covariance matrix of the received 
vector r(i). Specifically, when the eigenvalue spread of the 
covariance matrix of the received vector r(i) is large SG 
algorithms perform poorly, whereas the rate of convergence 



B. BER Performance versus Et /Nq, K and User Index 

In this part, the BER performance of the different receiver 
techniques is further investigated and the receivers process 
2000 symbols to obtain the curves. In particular, the average 
BER performance of the receivers versus Eb/No and number 
of users (K) is depicted in Figs. 7 and 8, whereas the individual 
BER performance versus the user indices is shown in Figs. 9 
and 10. 

A comparison of the CMV and the CCM design criteria with 
RLS algorithms is carried out in experiments whose results are 
shown in Figs. 7 and 8. The curves reveal that DF detectors 



10 



(a) K=10 users, t^T=0.0025 (b) E^/N^=15 dB, t^T=0.0025 (a) K=8 users, E|^/N^=1 5 dB, f J=0.001 (b) K=12 users, E^/N^=15 dB, fj=0,0025 




Fig. 7. Performance of CMV-RLS algorithms in a dynamic environment in 
terms of BER versus (a) Ef,/No witii K = 10 users and (b) number of users 
(K) at Et/No = 15 dB. 
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Fig. 8. Performance of CCM-RLS algorithms in a dynamic environment in 
terms of BER versus (a) Ei^/No with K = 10 users and (b) number of users 
(K) at E^/No = 15 dB. 



are significantly superior to linear receivers and that the CCM- 
RLS algorithm outperforms the CMV-RLS techniques in all 
situations. With respect to the performance, the best results 
are obtained with the ISPAP-DF receiver structure, that can 
save up to 2.5 dB for the same BER as compared to the 
iterative receivers of (TT] (ISP-DF and ISS-DF). In comparison 
with linear receivers, the proposed ISPAP-DF system obtains 
savings of up to 7 dB for the BER performance. In general, the 
curves in Figs. 7 and 8 reveal that the novel iterative arbitrated 
DF schemes, namely the SPA-DF, ISPAP-DF and ISPAS-DF, 
can offer considerable gains as compared to existing DF and 
linear receivers and support systems with higher loads, through 
mitigation of the effects of error propagation. 

The last two scenarios, shown in Figs. 9 and 10, consider 
the individual BER performance of the users. From the results, 
we observe that a disadvantage of S-DF relative to P-DF is 
that it does not provide uniform performance over the user 
population. We also notice that for the S-DF receivers, user 
1 achieves the same performance of their linear receivers 
counterparts, and as the successive cancellation is performed 



Fig. 9. Performance of the receivers in a fading environment in terms of 
individual BER versus user index for (a) SG and (b) RLS algorithms. 
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Fig. 10. Performance of the receivers in ternis of individual BER versus 
user index for (a) CCM-RLS and (b) CMV-RLS algorithms. 



users with higher indices benefit from the interference can- 
cellation. The same non-uniform performance is verified for 
the proposed SPA-DF, the existing ISS-DF and the novel 
ISPAS-DF Conversely, the new ISPAP-DF the existing P-DF 
and the existing ISP-DF provide uniform performance over 
the users which is an important goal for the uplink of DS- 
CDMA systems. In particular, the novel ISPAP-DF detector 
achieves the best performance of the analyzed structures and 
is significantly superior to the ISP-DF and to the P-DF, that 
suffers from error propagation. 

VII. Conclusions 

Blind adaptive SG and RLS type algorithms based on the 
CMV and CCM performance criteria were developed for 
estimating the parameters of DF receivers in uplink scenarios 
with multipath. The CCM-based blind algorithms have shown 
a performance that is very close to that of trained algorithms 
without the need for pilot channels. A novel SPA-DF structure 
was presented and combined with iterative techniques for 
use with cascaded DF stages, resulting in new iterative DF 
schemes, the ISPAS-DF and the ISPAP-DF, that can offer 
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substantial gains in performance over existing linear and 
DF detectors and mitigate more effectively the deleterious 
effects of error propagation. In particular, the proposed ISPAP- 
DF structure has achieved the best performance amongst all 
analyzed receivers and is able to provide uniform performance 
over the user population. 

Appendix 

In what follows, an analysis of the CCM method and its 
convergence properties is carried out for the linear receiver 
case (ffe = 0), extending previous results on its convexity 
for both complex and multipath signals. We believe that it 
provides a good starting point (better than the CMV design) 
for performing the cancellation of the associated users by the 
feedforward section of the DF-CCM receiver 



Let us express the cost function Jcm = E 



Jcm{-) in the noiseless case. Because u — Bw is a 
linear function of w then Jcm{^) being a convex function 
of u implies that Jca/(w) = Jcm (Bw) is a convex 
function of w. Since the extrema of the cost function can be 
considered for small a slight perturbation of the noise-free 
case lfT6l . the cost function is also convex for small 
when i/^|Aip|hf^hip > 1/4. If we assume ideal channel 
estimation (|hf^hi| = 1) and v = \, our result reduces to 
1^1 P > which is the same found in |31|. For larger 
values of ct'^, we remark that the term v can be adjusted 
in order to make the cost function Jcm in (10) convex, as 
pointed out in lfT6l . 

To derive a normalized step size for the algorithm in (24) 
and (25) let us drop the time index (i) for simplicity and write 
the constant modulus cost function Jcm = iV^k'^^ ffc^^p — 
1)^ as a function of (24) and (25): 



as Jcm = {E[\zk\'^] - 2E[\zk\'^] + 1), drop the time index Jcm = (|Pfc(wfe - n^vekzl)"v - ff b - ^/efcZfcb^b+ 



(i) for simplicity, assume a stationary scenario and that hk, 
k=l,...,K are statistically independent i.i.d complex random 
variables with zero mean and unit variance, hk and n are sta- 
tistically independent. Let us also define x = X^sLi ^fc^fe^fc' 
Cfchfc - Sfe, Q = S[xx«], P = £^[7777^], R = Q + P + (T^I 
and alternatively express the received vector by r(i) = x(i) + 
ri{i) + n(i), where 77(1) is the ISI. Considering user 1 as 
the desired one we let wi = w and define Uk = A^s^-w, 



u = A^S^w 

diag{Ai . . . Ak] 



- \a\ . . . ^ where S = [si . . . A 
and b = [61 



K 



1 



4Ef=2K"fc) + 1' where I) ^ u^u\ = z/2|Ai|2|hf hip. 
To examine the convergence properties of the optimization 
problem in (10), we proceed similarly to [1I6J. Under the 
constraint Cf^w = v\v\, we have: 



K 



Jcm{^) = Jcm{u) ^8{D + u^u)2 - - 4^K4)2 
- 4L> - 4(u^u) + 1 



k=2 



H 



16(D-l/4)I+16u^uI+16uu^-16dw5(|u2| 



whereas the 

^.2|Ai|2|hfhi|2 



k=2,k^3 I 

first term provides 
> 1/4 that ensures the convexity of 



k=3,k^K l"fel )j 

the condition 



(C,(CfCfe)-ihfc)^rp-l)^ 



(44) 



.Bk]^- Using the constraint 
C^w — i/hi we have for the desired user the condition 
ui = (A^sf )w = A*hCf w = i^A^hf hi. In the absence 
of noise and neglecting ISI, the (user 1) cost function can be 
expressed as Jcm(w) = £;[(u^bb^u)2]-2£;[(u^bb^u)] + 



If we substitute Pfe = I - (Cfe(Cf Cfc)-iCf into the first 
term of (44) and use C^Wfe = hk we can simplify (44) and 
obtain: 

Jcm = {\zk - M^efeZfer^Pfer - ^/efcZfeb^bp - 1)^ (45) 

Next, if we take the gradient of Jcm with respect to 11^ and 
equal it to zero, we have: 

VJ^„ = 2{\zk ~ HyjCkZkT^PkT - ^/efcZfcb^bp - l)x 
d 



\Zk 



c - HwekZkr^Pkr - /i/efeZfeb^bp = 



(46) 



From the above expression it is clear that this minimization 
leads to four possible solutions, namely: 



n.2 



1 - fifCkh^h „ 3 _ {\zk\ - 1) - Hf\zk\ekh"h 



ekr^PkV \zk\ekr"Pkr 
1 _ {\zk\ + 1) - fif\zk\ekh"h 



Izfelefer^Pfcr 



(47) 



(42) 



where u = [u2,...,ukV = Bw, B = A'"S'", S' ^ 
[§2 . . . sk] and A' — diag{A2 . . . Ak)- To evaluate the con- 
vexity of Jcm(-)' we compute its Hessian (H) using the rule 
H = _|^ ^(Jcm(u)) jjj^j yjgjjg. 



• \UK 

(43) 

Specifically, H is positive definite if a^Ha > for all 
nonzero a € C^^^""^'^ ggj. The second, third and fourth 
terms of (46) yield the positive definite matrix 16 (uu^ + 



By computing the second derivative of (44) one can verify 
that it is always positive for the third and fourth solutions 
above, indicating the minimum point. It should be remarked 
that the solution for fij is analogous to and leads to the 
same relations. Hence, we choose /i^ = (l^fcl+^)~^/J^^fcl'^'='^ ^ 
and introduce again the convergence factors fiQ^, and fiQj, 
so that the algorithms can operate with adequate step sizes 
that are usually small to ensure good performance, and 
thus we have 



(|zfc|+l)-M/|zfc|efcb b , 



(|zfc| + l)-Mt„|zfc|efcr"Ptr 

To derive a normalized step size for the SG algorithm in 
(26) and (27) let us let us drop again the time index (i) for 
simplicity and write the minimum variance cost function J = 
|wf r - bp as: 

Jmv = |Pfe(wfc - ^i^YxD^v - - ^fXkh"h+ 



-FCfc(CfC,)-ihfe)^r|2 



(48) 
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If we take the gradient of Jmv with respect to fi^ and equal 
it to zero, we get: 



VJ^„ = 2|Pfe(wfe 



HfXk 



R multipHed by a scalar factor a plus a noise-like term N, 
that for sufficient Eij/Nq has an insignificant contribution. In 
addition, when the symbol estimates = w|^r are reliable, 
that is the cost function in (10) is small (Jcm << 1), then 



C/j(Cf^C/j) ^hfe) r| X (— P/jrx^) r = (49) Iz^p has small variations around unity for both linear and DF 



If we substitute = I - (Cfc(Cf Ca;)"1C^ into the first 
term of (49) and use CfeW^ = we can eliminate the third 
term of (49) and obtain the solution: 



(50) 



Note that we introduce again a convergence factor /xq^, so that 
the algorithm can operate with adequate step sizes that are 
usually small to ensure good performance, and thus we have 
Next, we take the gradient of Jmv with 



'Pfcr 



respect to /i/ and equal it to zero: 

VJ^^. = 2|Pfc(wfc - ti^,rxl)"r - ff b - ^/Xfeb^b+ 

+C,(CfC,)-ihfc)^r| X (-a;, b«b)«r = (51) 

where it is noticed that the conditions are the same as for /i^. 
Thus we proceed similarly to obtain the step size which is 

(l-M„r"Ptr) 



given by /i j 



Remark again that we a conver- 



gence factor /ioj is applied so that the algorithm can operate 
with adequate step sizes that are usually small to ensure good 
performance, and thus we employ /i/ = /^O/ ^'"''•^ 



b«b 



Here, we discuss the suitability of the matrix Rfe, that arises 
from the CCM method, for use in channel estimation. From 
the analysis in Appendix I for the linear receiver, we have 
for an ideal and asymptotic case that Uk = (A^s^)wi 



Ai6iwf^si 



0, for k = 2,...,K. Then, 

wf n and |wfrp « A^|wf Sip + Ai6i(wiSi)n^Wi + 
Ai&J(sf^Wi)wf'^n + wf^nn^wi. Therefore, we have for the 
desired user (i.e. user 1): 

Ri = E\\w^r\^rr"] 



^ A^|wf SipR -t- Aiwf si£'[&in^wirr^] + 
Aisf wi£'[6iwf nrr^] + i;[wf nn^winn^] + 
^ A^|wf SipR + AiW^SiE[bin"vfirr"] + £[, 
+ Aisf wi£'[&*wf nrr-^] + cr^(R - a'^I)w^wi 
^ K|wf Sip + a^)R + A2a2(wf si)(wisf ) 
+ Ala\s^wi){siw^) + [dmg{\wi\\ . . . , \wn\ 



(T Qwf Vk^l 

wf nPnnflll] 



■ wiwf^] 



tj^wf Wil 



At 



|W"Si| 



Af ' A\ 

fT2(sfwi)(Siwf)' 



R 



2 

^((wfsi)(wisf) 



laguwil 



wiwf ] 



q;R + N 



Wll 



(52) 

where R = Q + a^i, q = ^[xx^] = y!Lx l^fel'sfesf- 
From (52), it can be seen that R^ can be approximated by 



detectors (note that = w^r — f^b for the DF receivers), 
yielding the approximation 

E\\zkfvv^\ = E[rr"] + E[{\zl - l)rr^] ^ E[rr"] = R 

(53) 

Therefore, we conclude that the channel estimation can be 
performed on R^ in Ueu of R, since the properties of the 
mati-ix R studied in ||26l, ||23 hold for Rfe. 
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